Technical Report No. 32-445 

Sequential Estimation of Correlated 
Stochastic Variables 

C. G. Pfeiffer 



JET PROPULSION LABORATORY 
California Institute of Technology 
Pasadena, California 


July 1, 1963 


k.O 



Technical Report No. 32-445 


Sequential Estimation of Correlated 
Stochastic Variables 

C. 6. Pfeiffer 



T. W. Hamilton, Chief 
Systems Analysis Section 


JET PROPULSION LABORATORY 
CALIFORNIA INSTITUTE OF TECHNOLOGY 

Pasadena, California 


July 1, 1963 



Copyright © 1963 
Jet Propulsion Laboratory 
California Institute of Technology 

Prepared Under Contract No. NAS 7-100 
National Aeronautics & Space Administration 



JPL TECHNICAL REPORT NO. 32-445 


CONTENTS 


I. Summary 

II. The Gauss-Markoff Theorem 

III. Sequential Estimation 

IV. Proof of the Fundamental Theorem 

V. Examples of Sequentially Correlated Processes 

VI. Discussion of Kalman’s Model 

VII. Conclusion 

Nomenclature 

References 


FIGURES 


1 . The stochastic process 

2. The sequentially correlated scintillation noise 

3. The sequentially correlated linear system . . . 


Ill 



JPL TECHNICAL REPORT NO. 32-445 


ABSTRACT 

A technique is developed for estimating the components of the se- 
quence of correlated random vectors {x x , x 2 , •••, x„, ••• }, when given 
the sequence of linearly related data vectors {pi, p 2 , '** ,p„, ••• }. It is 
shown that a necessary and sufficient condition that the minimum 
variance estimate of x n depend only upon p„ and the previously com- 
puted estimate of x n i is that all x, be sequentially correlated. This 
is a condition placed upon the covariance matrix describing the other- 
wise unspecified physical process that generates the x s . Examples 
of sequentially correlated processes are given, and an application to 
the deep-space orbit determination problem is discussed. 


I. SUMMARY’ 


This Report develops a technique for sequentially ob- 
taining the minimum variance estimate of the components 
of the vectors Xj in the time-ordered sequence of corre- 
lated stochastic vectors 

(Xij X 2 , j ^n> } 

when given observed data linearly related to the Xi. The 
stochastic vectors X* are the values x(tj) at the discrete 
times 

{ti, t 2 , ’ * *, t n> • * * | 

where x(t) is the output of some unspecified physical 
process. It is assumed that 

x"i — 0 

and that the correlations 

[xi x T { ] = Ai 


and 

[Xj X^ = Fji 

are known. The "noise” on the observations is considered 
to be imbedded in the x t , thus the observations are of 
the form 

01 = 0(ti) = Ai Xj 
where Ai is a known matrix. 


’Notation: All random variables are assumed to be column vectors 
which are denoted by boldface type. Matrices are denoted by ital- 
icized capital letters; I is the identity matrix. The superscript T 
indicates the transpose of a matrix or vector, and the subscript i 
refers to the time ti. The bar ( — ) over a quantity indicates the 
statistical average over the ensemble of all experiments. The asterisk 
indicates the minimum variance estimate of the quantity xi, con- 
sidering all data up to and including time ti, while the ^ indicates 
the estimate of the x ( based only upon xi_,. See the Nomenclature 
for definitions of symbols. 
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Since the data are gathered and estimates are made 
only at the discrete times t i? the Gauss -Markoff theorem 
can be applied to yield a minimum variance estimate of 
any Xi. This estimate depends, in general, upon all ob- 
served data up to and including time t { . The situation 
becomes unwieldly if a large number of observations are 
gathered, but Kalman (Ref. 1) shows that a relatively 
simple iterative estimation procedure can be performed 
if it is postulated that each bit of observed data arises 
from sampling the output of a linear system excited by 
uncorrelated, Gaussian error sources (see Part V, the 
linear control process). In this case, only the statistics of 
the most recent estimate of the state of the system need 
be considered when processing each new data point, 
making the situation much more tractable. It is the pur- 
pose here to develop a similar iterative estimation tech- 
nique without constructing such a system model, thereby 
demonstrating the most general type of stochastic process 
that can be treated in this fashion and producing estima- 
tion formulas that do not depend upon dealing with 
uncorrelated error sources. 

Given the sequence of data vectors 

(0!, 0,, •••,0„, •••} 

where 

0 i — Ai X [ 

it is shown that a necessary and sufficient condition that 
the minimum variance estimate of Xj depend only upon 
the observation 0 3 and the previous estimate x* is that all 
Xi be “sequentially correlated”, that is, 

R ki = R kj Rji for all t t < tj <t k (1) 


where 

Rji = Pji A - 1 


is defined to be the * normalized correlation” between Xj 
and Xj. In this case, the estimate of xj, given the previous 
estimate xf but not the observation 0j, is 



Xj = flj, xf 

(2) 

Defining the errors 

1! 

?> 

1 

X 


and 

«l = (xf — X, ) 


then 



[Wj = 

Aj - Rji JA, - [c~ T ][ «!. 

(3) 


The minimum variance estimate of Xj then becomes 

x* - ^ -f- Wj(0j - AA) (4) 


where 

W,=[Mt] A] [Aj [liij] aj]' 1 (5) 

The covariance of the error in xf is 

pi] = pi] - WjAj p*] (6) 

The process is repeated at time t k > tj, and the estima- 
tion proceeds iteratively to conclusion. 

Examples of sequentially correlated processes are 
given, and the relation to Kalman s result is discussed. 


II- THE GAUSS-MARKOFF THEOREM 


The sequential estimation technique discussed in this 
Report is developed from the well-known Gauss-Markoff 
theorem (Ref. 2). In this Part, a general form of the 
theorem will be presented. 

Suppose a data vector 0 is observed on some experi- 
ment, and the value of some unknown random vector x 
is to be estimated from the given 0, where 0 and x have 
known statistical correlation over all such experiments. 


It will be assumed that these quantities have zero a priori 
mean, that is, 0=0 and x = 0. The specification of a 
linear, unbiased minimum variance estimate leads to a 
unique estimate of x. Let this estimate be 

x* = K0 

where K is a matrix to be determined. Note that x* is 
unbiased, that is, 

x* = 0 
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DEFINITION 1: The linear unbiased estimate x* is said 
to be a minimum variance estimate if the error covari- 
ance matrix [ee T ] is a minimum over all experiments, 
where 

e = x* — x 


This means that for any other estimate 
x* = L0 

with error 

€l = (Xf - x) 

the quadratic form associated with { - [« T ]} 

is positive semidefinite. 


Theorem 1: The minimum variance estimate of x, given 

0, is 

x * = [i0*] [00*] 0 (7) 2 

The error in the x* has covariance 

[iF] = [SF] - [i^] [00^] 1 [0**] (8) 


Proof: Suppose the a priori covariance matrix A is given, 
where 


[00 T ] [0X T ] 


! 

-G- 

■©- 

< 

1 

[X0*] [XX T ] 


Px0 A x 


Let 


x* = K 0 


where the elements of K are to be determined. Then 


[€€ t ] - KA*K T - KP$ X - P X $K T + A x (10) 


Let z be an arbitrary vector. Then from Definition 1 it 


2 As in Eq. (7), it will be assumed throughout this Report that the 
inverse of all a priori covariance matrices exists. There is no loss of 
generality in this assumption, since the condition can always be 
realized in theory and in practice by eliminating the redundant 
(perfectly correlated) variables from the problem. 


follows that the quadratic form z T [e€ T ] z must be a 
minimum with respect to all elements of K. This implies 

0 = [KA+ - P x *] 8K + { [KA* - P*»] 8K} T (11) 

Thus 

K = P x * A- 1 (12) 

which will be defined as the “normalized correlation” 
between x and 0. Equation (8) follows from substituting 
Eq. (12) into Eq. (10). This completes the theorem. 

Theorem 1 is often presented in a different form when 
discussing the process (Ref. 3) 

0 — Ax + n (13) 

where x is constant over any given experiment and n is 
noise on the observations that is not correlated with x. 
If the a priori statistics over all experiments are 

[xx T ] = A 0 

and 

[=’]=f 

(F is usually assumed to be a diagonal matrix), then 


[x0 T ] = A„A T (14) 

[00*] = aa„a t + r ( 15 ) 

From Eq. (7) and Eq. (8), 

x* = A„A T [Aa„A t + T] 1 0 (16) 

[ii*] = Ao - A 0 A T [AA„A t + T ] -* Aa 0 (17) 

By a matrix identity (Ref. 4, p. 79), 

X* = [A- 1 + A T r-’A]-> [A T r-> 0] (18) 

[ii*] = [A- 1 TAT'A]- 1 (19) 

If A„— > oo (no a priori information on x), then Eq. (18) 
becomes 

X* = [AT 1 A] 1 [A T r-' 0] (20) 


which is the classical result. If T is diagonal with equal 
elements, Eq. (20) becomes the familiar least-squares 
estimate. 
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UK SEQUENTIAL ESTIMATION 


This Report is concerned with the estimation of certain 
time-varying vector random variables 

Xi = x(tj) 

when given the sequence of observed data vectors 
where 

0i = AjXi 

An iterative estimation procedure will be sought, where 
only the most recent estimate of the state of the system 
will be considered in processing each new data point. It 
will be shown in Part IV that this is possible if and only 
if the quantities to be estimated are “sequentially corre- 
lated.” An intuitive derivation of the result will be pre- 
sented here, and a more rigorous treatment will be given 
in Part IV. 

The motivation for this approach is the analysis of the 
orbit determination process (Ref. 3) in the presence of 
correlated measurement errors. The problem arises as 
follows. Let 

+ (21) 

where y* is the coordinate deviation vector at time t 5 and 
[ctyi/dyo] is the state transition matrix that relates the 
initial condition variations y 0 to y im Equation (19) is 
easily put into the form of Eq. (13) by constructing the 
total observation vector 

The dimension of <f> = (number of data points/sample 
time) X (number of sample times), but the estimate 
(Eq. 19) is quite tractable if the noise on the data is un- 
correlated. If this is not the case, however, it is convenient 
to define 



and thus 

0i = Ai Xi 


which is the problem formulation discussed above. 

With this example in mind, consider a more general 
process (see Fig. 1), where observations are being made 
during an experiment in an effort to estimate the cor- 
related stochastic variables 

Xi,X 2 , 

The first moment of the stochastic process is given by 

Xi = 0 

and the second moment by 


[x jX T] = Pji 



TIME 

Fig. 1 . The stochastic process 

It will be convenient to define 


Rj i = Pj iAy 1 

Suppose an estimate xf has been obtained, and, ignoring 
all other information, an estimate of Xj (tj > t 4 ) is to be 
made based on xf . From Eq. (7), this ought to be 

Xj = RjjX* (23) 

where Rj j is the * normalized” a priori correlation between 
Xi and Xj . The ^ notation indicates that new data at tj 
have not been considered in estimating Xj . The error 
expression is 

€j - (Xj - Xj) = Rji (Xi 4- e,) - Xj (24) 
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If it is assumed that is not correlated with x 5 or x^ the 
covariance matrix ^ejtjJis gi yen by Eq. (3). Equations (4) 
through (6) now follow directly by an application of 
Theorem 1. In effect, this technique treats each xj as an 
observation of Xj and optimally combines the Xj and 
to obtain the estimate xf . 

Certain elementary special cases offer an intuitive check 
on this result: 

1. No a posteriori information on x s is available. Then 



2. No correlation exists between Xj and x*. Then 


Rji = 0 



and 

[Pl] = A, 

3. Perfect correlation exists between Xj and x 4 (they 
are the same quantities). Then 

Rji - / 

a * 

Xj = X* 

and 

[Pi] = [Pi] 

The conditions under which the sequential estimation 
technique can be applied will be developed in Part IV. 
It should be observed, however, that a sequential estimate 
can always be made in engineering applications, and the 
resulting error variance may be near minimum. 


IV. PROOF OF THE FUNDAMENTAL THEOREM 


In this Part, the results stated in the Summary will 
be established by proving the fundamental theorem of 
sequential estimation (Theorem 4). The derivation calls 
upon well-known “escalator methods' for iteratively in- 
verting symmetric matrices (Ref. 4). 


Theorem 2: The minimum variance estimate of x c , given 
(x a , x b ), depends only upon x b if and only if these quanti- 
ties are sequentially correlated. The estimate in this case is 

x c = Rcb X b (26) 

where 


DEFINITION 2: The stochastic vectors, x a , x b , x c , de- 
fined at times t a , t b , t c , respectively, are said to be sequen- 
tially correlated when 

Pc a = Pcb A, 1 Pba (25) 

for t a < t b < t c , where 

A b = [x b xj] 

Pea = [X c xj] 

and and P ba are similarly defined. 


Rcb = PcbA- 1 


Proof: Let the covariance matrix describing x a , x b , x c be 


A abc 



Pab 

A b 

Pcb 



From Eq. (7), 


(27) 
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Equation (28) reduces to 3 

x c = (P ra - Pcb A' 1 P ba ) (B)x A 

+ (“P C a + P cb A' 1 Pba) (B) (Pel, A^)x b (29) 

+ Peb A ’ 1 X b 

where 

B= [PabA-P ba - A a ] (30) 


Proof: A stochastic process is said to be Markovian if the 
conditional probability of a future state depends only on 
the present state but not on the past history of the process. 
The process is said to be Gaussian if the x t are Gaussian. 
The minimum variance estimate completely determines 
the conditional distribution of the estimate if the random 
variables considered are chosen from Gaussian distri- 
butions, since the estimate then is Gaussian and the mean 
and variance are specified. This proves the corollary. 


(The existence of the B matrix is assumed, following Foot- 
note 2.) The theorem thus follows directly from Eq. (29). 

Theorem 3: A necessary and sufficient condition that the 
minimum variance estimate of x n+1 , given the time-ordered 
sequence 

(x 1} x a , -,x n } 


The difficulty remains that the x b i = 1, • * *, n, are usu- 
ally not given; only certain observations and the estimates 
of the Xi that follow from them are available. Theorem 3 
can be applied if the 0* or the x* are sequentially cor- 
related, but a more useful result would place a condition 
on the parameters to be estimated. Theorem 4 is the 
fundamental theorem which yields this result for the case 
of observations that are linearly related to the parameters. 


depend only on x n is that x i5 x n , and x n+1 be sequentially 
correlated for all i<n. The estimate in this case is 


Proof: Let 


and 


Xn+1 Pn+l,nX n 

K =[*7> X J» •••><-.] 

X b = X n 


(31) 


Theorem 4; (The fundamental theorem of sequential esti- 
mation.) Let the observations (0 n ) taken up to and includ- 
ing time t n be linearly related to the [x t • • • x n j, i.e., 


0 n = A n z n 


At 0 

0 a 2 


0 0 


0 

0 

A n 


(36) 


where 


Then 

Pea ~ 

Xe X n+1 

[xTx^] = [P„ + 1,1, P 0 + I.s, •••, Pn.l,n-l] 

(32) 

z" T = [x*,xj, -,xj] 

and A n is a matrix with nonrandom elements. Then the 
minimum variance estimate of x n+1 , given the minimum 
variance estimates 

Pcb ~ 

[x c Xj] = Pntl.n 

(33) 

{xf.x*, 

Pba “ 

[x^X^] = [P„,i,Pn,S, Pn.n-l] 

(34) 

is 

A 

A b = 

[^*n] = A » 

(35) 

X n+ i = Rn + l,n X J (37) 

if and only if x n+1 , x n , and Xj are sequentially correlated for 

The proof now follows from Theorem 2 and the definition 
of sequential correlation. 

i < n. The covariance of the error in x n+1 is 

= A n.i — Rn.i,n | A„ - [e„€j]|RLl.n 


A sequentially correlated process is a generalized 
Markoff process, as the following corrollary points out: 

Corollary: A Gaussian stochastic process is Markovian if 
and only if the correlation is sequential. 


(38) 


Proof: Let x n+1 be the estimate based on data up to, but 
not including, time t n +i. Then 


x n+ i = [x n+1 0 nT ] [0 n 0 nT ]" 1 0 n 


3 From Ref. 3, p. 78. 


= [x n+1 z nT ] [z" z nT ] [z n z nT ] [A nT ] [0 n 0 nT ] -1 0 n 
= [x n+1 z nT ] [z n z nT ] - 1 z n * (39) 
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But Eq. (39) is identical in form with the estimate based 
on the true z„; thus Eq. (37) follows directly from 
Theorems 2 and 3. 

Let 

^m-l + i + j (40) 

Then, from Theorem 1, 

[«n.l«Z*,] = An., — R„.l,„ [ X ? ***] RLl.n 

(41) 

But 

[ X H' x r] = [ x n0 nT ] [0 n 0 nT ] -1 [0" X I] =A„ - [e„€^] 

(42) 


where 

€ n = X* - X n (43) 

Equation (38) follows from Eq. (41) and (42). This com- 
pletes the theorem. 

Theorem 4 is the foundation for the sequential esti- 
mation procedure described in the Summary, for now 
Theorem 1 can be applied to improve the estimate x n+1 by 
the incorporation of the new data gathered at time t n +i. 
Note that the x* +1 so obtained is a minimum variance 
estimate for the data set { 0 1 , * * *,0 n+ i}, but not for a data 
set that includes the later measurements { 0 n + 2 , 0 n+ 3 , “*}• 
In most applications, however, it is sufficient to obtain a 
best estimate of only the most recent state of the system. 


V. EXAMPLES OF SEQUENTIALLY CORRELATED PROCESSES 


A sequentially correlated process can be defined by specifying the 
functional form of the normalized correlation matrices, or by describ- 
ing the physical system which generates the stochastic vectors x(t 4 ). 
Examples are given below: 

1. The general functional form. Let 

Rji — F(tj) F _1 (t») 

where F is an arbitrary non-singular matrix. It can be shown that 
Rji always has this functional form for a continuous sequentially 
correlated process. 

2. Perfectly correlated variables . Let x(t) be constant over any one 
experiment; then 

Rji =1 

3. Uncorrelated variables. Let x* and Xj be uncorrelated; then 

Rji = the matrix delta function 
(Equation 1 holds for ti < tj < t k .) 

4. Simple exponential correlation. Let each component of x(t) be 
exponentially correlated, and let the cross-correlations be zero; then 

exp {a(tj — ti)) 0 ■ 0 

0 exp {^(tj - ti)} 0 


0 0 •** exp {"(tj — ti)} 
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5. Orthogonal correlation. Let x(t) be orthogonally cor- 
related in two dimensions; then 

cos (tj — ti) —sin (tj — ti) 

sin (tj — ti) cos (tj — ti) 

It can be shown that a necessary and sufficient con- 
dition that R k i = Rkj Rji for all ti, t js t k (not just 
ti < tj < t k ) is that Rji be orthogonal. 

6. The stationary process. Let £ be a constant matrix, 
and let 

Rji = exp[(tj— ti)E] = 7 + (tj - ti)£ + ••■ 

+ (tj ~ ti) n E n + ... 
n! 

It can be shown that a necessary and sufficient con- 
dition that a continuous sequentially correlated proc- 
ess be stationary is that Rji be of this form (Ref. 5). 

7. General exponential correlation . Let the matrix C(t) 
have the commutative property C(tj) C(tj) = C(t ( ) 
C(tj), and take 

Rji = exp[C(t,) - C(t,)] 

(See Ref. 5 for a discussion of the matrix exponential.) 

8. Scintillation noise. (See Fig. 2.) The scalar signal s(t) 
is multiplied by the ra ndom gain g(t) to obtain 

= g( t) s(t), where g(t) = 1, s(t) = 0, and 
[s(ti) g(tj)] = 0. Let n(t) = g(t) - 1, and 




Rg. 2. The sequentially correlated scintillation noise 


Then 

*(t) = s(t) + s(t)n(t) = Ax(t) 

where A — [1, 1] . Thus x(t) is a sequentially corre- 
lated process if both s(t) and n(t) are sequentially 
correlated. 

9. The linear control process . 4 (See Fig. 3.) Let x(t) be 
derived from the process shown below, where 

[>k*T] = [nkx| ] [xjxf ] [xi*!] 

It can be shown that this is the most general noise 
model which will yield a sequentially correlated linear 
control process. 

4 This model is discussed in Part VI. 



Fig. 3. The sequentially correlated linear system 
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VI. DISCUSSION OF KALMAN’S MODEL 


The linear control process of Part V is a generalization 
of the model discussed by Kalman in Ref. 1, where he 
postulated 

[nin^] = 0 

for all i, j. It is easily verified that in this case Eq. (3) 
reduces to 


[^] = 17,, [«,er] C7J, +[»,■}] 

Note that a sequentially correlated process sampled at the 
discrete times t* can be put into Kalman's form by simply 
defining 


and 


t/j. = Rji 


[njiiJ] = [Aj - RjtAiRT] 


This replaces the true physical process with an artificial 
one that has random discontinuities at the selected sam- 
ple points ti. The covariance matrix of the “noise" on the 
artificial process has the diagonal form 


[""q 


[n,n;] 0 0 0 

0 [ n 2 n J] '* 0 ^ 


0 0 [n in T] •** 0 


t! t 2 ti 


With this construction in mind, the fundamental theorem 
of sequential estimation demonstrates the most general 
stochastic process that can be analyzed by Kalman s 
method. It provides a simple means for constructing his 
equivalent linear control process. 


VII. CONCLUSION 


An estimation procedure has been presented that is 
quite practical for obtaining numerical answers to many 
physical problems, and that provides a convenient ana- 
lytic framework for exploiting theoretical problems in 
control theory. The only description of the system needed 
is the mean and covariance matrix of the time-varying 
vector x(ti). The assumptions required are (1) the se- 
quential correlation condition on the covariance matrix 
of the Xi and (2) the linear relationship between the ob- 
servations 0i and the state vector x t . Since the statistical 
description of a physical system is often not well known, 
it is many times possible simply to postulate sequential 
correlation and be assured of obtaining a reasonable esti- 


mate. The extension of the result to include nonlinear 
relationships between 0i and Xj is not apparent. How- 
ever, such a generalization might not be very useful since 
the validity of a linear and unbiased estimate would be 
called into question. 

One of the limitations of this estimation technique is 
that it depends so strongly on the a priori correlations, 
which are often not well known. The estimation can be 
thought of as a weighting process, where each new data 
point is weighed against the previous (a priori) informa- 
tion. This can lead to an erroneous result if the a priori 
statistics are not correct. 
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NOMENCLATURE 

T superscript indicating the transpose of a matrix or vector 
— indicates the statistical average of a quantity over the ensemble 
of all experiments 

Scalars 

t time 

a, b, ■ , n, • subscripts referring to times t a , t b , * * * . The times are ordered 
according to the alphabetic sequence, thus t a < t b < 
x, y, z, <j> subscripts on a covariance matrix indicating to which random 
variables it applies 

8 denotes a variation in the indicated quantity 
L a special subscript referring to a non-optimum (not minimum 
variance) estimate 

a, /?, y, • • * the exponential decay constants (example 4) 
s a signal to be detected (example 8) 
g the gain multiplying s (example 8) 

Vectors 

x a column vector with random, time-varying components 
Xi the value of x at the time ti, thus x t = x(ti) 

an observation (data) taken at time t t , thus 0j — 0(t t ) 
x* the minimum variance estimate of x t based on all observations 
up to and including time t t 
the error in x* , thus e* = x* — Xi 

Xi the minimum variance estimate of X* based on all observations 
prior to, but not including, time tj 
Ci the error in x i? thus € x = Xi — x t 

z a dummy vector introduced to develop the proof of the Gauss- 
Markoff theorem 

z n a total random vector formed from all Xj up to and including 
time t n , thus z nT = •••,xj} 

0 n a total observation vector formed from all 0j up to and including 
time t n , thus 0 nT = (0^, 0| , *, 0*} 

n noise on an observation 

y\ the coordinate deviation vector at time t i? which is to be esti- 
mated by the orbit determination process 

Matrices 

7 the identity matrix 

A x the covariance matrix describing the second moment of the 
distribution of the random vector x 

A <p the covariance matrix similarly defined for the random vector <j> 
A i the covariance matrix similarly defined for the random vector Xi 
A 0 the covariance matrix similarly defined for the unknown param- 
eter vector x (Eq. 14, 15) 
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NOMENCLATURE (Cont’d) 


Pji the covariance matrix describing the linear correlation between 
the random vectors Xj and Xi 

R H the normalized correlation between Xj and Xj, given by Rji = 

PjiAj 1 2 3 4 5 6 

Wj the matrix which optimally weights each each new data point at 
the time tj 

A a matrix which relates observations to unknown parameters in the 
system (Eq. 13) 

r the covariance matrix of noise on the observations (Eq. 15) 

Ai a matrix which relates the observations <£i to the random vectors 

Xi 

A n a matrix formed from all A* matrices up to and including time 
t n (Eq. 36) 

B a combination of matrices that appears in the proof of Theorem 2 
C a matrix introduced in example 7 
K, L dummy matrices introduced in the proof of the Gauss- Markoff 
theorem 

E a constant matrix (example 6) 

E n indicates E multiplied by itself n times 

U ji the state transition matrix relating Xi to Xj in the linear control 
process (example 9) 

F an arbitrary non-singular matrix introduced in example 1 
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